NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

On a Scale of 1 to 5, How Reliable Are AI User Studies? A Call for Developing Validated, Meaningful Scales and Metrics about User Perceptions of AI Systems

Tolsdorf, Jan; Luo, Alan F; Kodwani, Monica; Eum, Junho; Sharif, Mahmood; Mazurek, Michelle L; Aviv, Adam J (May 2025, 9th Workshop on Technology and Consumer Protection (ConPro ’25))

Full Text Available
Training Robust ML-based Raw-Binary Malware Detectors in Hours, not Months

https://doi.org/10.1145/3658644.3690208

Lucas, Keane; Lin, Weiran; Bauer, Lujo; Reiter, Michael K; Sharif, Mahmood (December 2024, ACM)

Full Text Available
DrSec: Flexible Distributed Representations for Efficient Endpoint Securit

Sharif, Mahmood; Datta, Pubali; Riddle, Andy; Westfall, Kim; Bates, Adam; Ganti, Vijay; Lentz, Matthew; Ott, David (May 2024, Proceedings of The 45th IEEE Symposium on Security and Privacy (IEEE SP))

Full Text Available
Group-based Robustness: A General Framework for Customized Robustness in the Real World

https://doi.org/10.14722/ndss.2024.24084

Lin, Weiran; Lucas, Keane; Eyal, Neo; Bauer, Lujo; Reiter, Michael K.; Sharif, Mahmood (February 2024, Network and Distributed System Security Symposium)

Machine-learning models are known to be vulnerable to evasion attacks, which perturb model inputs to induce misclassifications. In this work, we identify real-world scenarios where the threat cannot be assessed accurately by existing attacks. Specifically, we find that conventional metrics measuring targeted and untargeted robustness do not appropriately reflect a model’s ability to withstand attacks from one set of source classes to another set of target classes. To address the shortcomings of existing methods, we formally define a new metric, termed group-based robustness, that complements existing metrics and is better suited for evaluating model performance in certain attack scenarios. We show empirically that group-based robustness allows us to distinguish between machine-learning models’ vulnerability against specific threat models in situations where traditional robustness metrics do not apply. Moreover, to measure group-based robustness efficiently and accurately, we 1) propose two loss functions and 2) identify three new attack strategies. We show empirically that, with comparable success rates, finding evasive samples using our new loss functions saves computation by a factor as large as the number of targeted classes, and that finding evasive samples, using our new attack strategies, saves time by up to 99% compared to brute-force search methods. Finally, we propose a defense method that increases group-based robustness by up to 3.52 times.
more » « less
Full Text Available
Adversarial Training for Raw-Binary Malware Classifiers

Lucas, Keane; Pai, Samruddhi; Lin, Weiran; Bauer, Lujo; Reiter, Michael K.; Sharif, Mahmood (August 2023, 32nd USENIX Security Symposium)

Machine learning (ML) models have shown promise in classifying raw executable files (binaries) as malicious or benign with high accuracy. This has led to the increasing influence of ML-based classification methods in academic and real-world malware detection, a critical tool in cybersecurity. However, previous work provoked caution by creating variants of malicious binaries, referred to as adversarial examples, that are transformed in a functionality-preserving way to evade detection. In this work, we investigate the effectiveness of using adversarial training methods to create malware-classification models that are more robust to some state-of-the-art attacks. To train our most robust models, we significantly increase the efficiency and scale of creating adversarial examples to make adversarial training practical, which has not been done before in raw-binary malware detectors. We then analyze the effects of varying the length of adversarial training, as well as analyze the effects of training with various types of attacks. We find that data augmentation does not deter state-of-the-art attacks, but that using a generic gradient-guided method, used in other discrete domains, does improve robustness. We also show that in most cases, models can be made more robust to malware-domain attacks by adversarially training them with lower-effort versions of the same attack. In the best case, we reduce one state-of-the-art attack’s success rate from 90% to 5%. We also find that training with some types of attacks can increase robustness to other types of attacks. Finally, we discuss insights gained from our results, and how they can be used to more effectively train robust malware detectors.
more » « less
Full Text Available
Adversarial training for raw-binary malware classifiers

Lucas, Keane; Pai, Samruddhi; Lin, Weiran; Bauer, Lujo; Reiter, Michael K.; Sharif, Mahmood (August 2023, USENIX Security Symposium)

Machine learning (ML) models have shown promise in classifying raw executable files (binaries) as malicious or benign with high accuracy. This has led to the increasing influence of ML-based classification methods in academic and real-world malware detection, a critical tool in cybersecurity. However, previous work provoked caution by creating variants of malicious binaries, referred to as adversarial examples, that are transformed in a functionality-preserving way to evade detection. In this work, we investigate the effectiveness of using adversarial training methods to create malware-classification models that are more robust to some state-of-the-art attacks. To train our most robust models, we significantly increase the efficiency and scale of creating adversarial examples to make adversarial training practical, which has not been done before in raw-binary malware detectors. We then analyze the effects of varying the length of adversarial training, as well as analyze the effects of training with various types of attacks. We find that data augmentation does not deter state-of-the-art attacks, but that using a generic gradient-guided method, used in other discrete domains, does improve robustness. We also show that in most cases, models can be made more robust to malware-domain attacks by adversarially training them with lower-effort versions of the same attack. In the best case, we reduce one state-of-the-art attack’s success rate from 90% to 5%. We also find that training with some types of attacks can increase robustness to other types of attacks. Finally, we discuss insights gained from our results, and how they can be used to more effectively train robust malware detectors.
more » « less
Full Text Available
Privacy-Preserving Collaborative Genomic Research: A Real-Life Deployment and Vision

https://doi.org/10.1145/3689942.3694747

Rahmani, Zahra; Shahini, Nahal; Gat, Nadav; Yun, Zebin; Jiang, Yuzhou; Farchy, Ofir; Harel, Yaniv; Chaudhary, Vipin; Ayday, Erman; Sharif, Mahmood (November 2023, ACM)

Full Text Available
Scalable verification of GNN-based job schedulers

https://doi.org/10.1145/3563325

Wu, Haoze; Barrett, Clark; Sharif, Mahmood; Narodytska, Nina; Singh, Gagandeep (October 2022, Proceedings of the ACM on Programming Languages)

Recently, Graph Neural Networks (GNNs) have been applied for scheduling jobs over clusters, achieving better performance than hand-crafted heuristics. Despite their impressive performance, concerns remain over whether these GNN-based job schedulers meet users’ expectations about other important properties, such as strategy-proofness, sharing incentive, and stability. In this work, we consider formal verification of GNN-based job schedulers. We address several domain-specific challenges such as networks that are deeper and specifications that are richer than those encountered when verifying image and NLP classifiers. We develop vegas, the first general framework for verifying both single-step and multi-step properties of these schedulers based on carefully designed algorithms that combine abstractions, refinements, solvers, and proof transfer. Our experimental results show that vegas achieves significant speed-up when verifying important properties of a state-of-the-art GNN-based scheduler compared to previous methods.
more » « less
Full Text Available
Constrained Gradient Descent: A Powerful and Principled Evasion Attack Against Neural Networks

Lin, Weiran; Lucas, Keane; Bauer, Lujo; Reiter, Michael K.; Sharif, Mahmood (July 2022, The 39th International Conference on Machine Learning)

Full Text Available
Constrained Gradient Descent: A Powerful and Principled Evasion Attack Against Neural Networks

Lin, Weiran; Lucas, Keane; Bauer, Lujo; Reiter, Michael K.; Sharif, Mahmood (July 2022, Proceedings of Machine Learning Research)

Full Text Available

« Prev Next »

Search for: All records